Ladder Filter Power Spectrum Estimator Power Spectrum Estimator Smoothing Filter Mean Spectral Energy Difference Smoothing Filter Threshold Detector Further Process Rippling Level Estimator

نویسندگان

  • R. Martínez
  • A. Álvarez
  • P. Gómez
  • M. Pérez
  • V. Nieto
  • V. Rodellar
چکیده

The determination of the precise moment in which speech begins or ends is an important problem in ASR. As showed in [1], small separations from the optimum beginning and ending point, imply a great decrease in the recognition accuracy. The presence of noise [2] [3], specially when its level is high (around 95 dB as in the case of this work), and its characteristics are highly nonstationary, is an added problem, since it can produce false shots (more probable when the noise includes speech sounds). That is the reason why in such conditions, it is important to have a pre-processing stage that removes as much noise as is possible, and that gives some clues that help to build an end-point detector for those environments. The method here presented offers a pre-processing technique for highly noisy and non stationary environments, which at the same time that enhances the speech, gives an equalised version of the SNR improvement (Mean Spectral Energy Difference), whose main characteristic is that large differences in the level of noise are changed to a little ripple, while the presence of speech is distinguished by a large decrease in this Mean Spectral Energy Difference. Following this technique, any End-point Detection approach (explicit, implicit or hybrid [3]) may render acceptable results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ladder Filter Power Spectrum Estimator Power Spectrum Estimator Smoothing Filter Mean Spectral Energy Difference Smoothing Filter Threshold Detector Further Process Rippling Level Estimator Time Domain Power Estimator Decision Rule

The determination of the precise moment in which speech begins or ends is an important problem in ASR. As showed in [1], small separations from the optimum beginning and ending point, imply a great decrease in the recognition accuracy. The presence of noise [2] [3], specially when its level is high (around 95 dB as in the case of this work), and its characteristics are highly nonstationary, is ...

متن کامل

Lag-windowing and multiple-data-windowing are roughly equivalent for smooth spectrum estimation

There is no fundamental difference between lag-windowing a correlation sequence and multiple-windowing a data sequence when the objective is to reduce the mean-squared error of a spectrum estimator. By analyzing the approximate low-rank factorization of a bandlimiting Toeplitz operator, we find that lag-windowed (or spectrally smoothed) spectrum estimators have multiple-data-windowed implementa...

متن کامل

A Noise Estimator with Rapid Adaptation in Variable-Level Noisy Environments

In this paper, a noise estimator with rapid adaptation in a variable-level noisy environment is presented. To make noise estimation adapt quickly to highly non-stationary noise environments, a robust voice activity detector (VAD) is utilized in this paper and it depends on the variation of the spectral energy not on the amount of that. The noise power spectrum in subbands are estimated by avera...

متن کامل

Emg Amplitude Estimation: a Review of the past and a Look towards the Future

AND INTRODUCTION The amplitude of the surfa.ce EMG is frequently used as the control input to myoelectric prostheses, as a measure of muscular effort, and has also been investigated as an indicator of muscle force This paper will review the methods which are used to estimate the EMG amplitude from recordings of the EMG waveform. (Note that this review does not include the related area of EMG-to...

متن کامل

Lightweight Filter Architecture for Energy Efficient Mobile Vehicle Localization Based on a Distributed Acoustic Sensor Network

The generic properties of an acoustic signal provide numerous benefits for localization by applying energy-based methods over a deployed wireless sensor network (WSN). However, the signal generated by a stationary target utilizes a significant amount of bandwidth and power in the system without providing further position information. For vehicle localization, this paper proposes a novel proximi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997